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HAPLOTYPE ANALYSIS 

BACKGROUND OF THE INVENTION 
[001] Single nucleotide polymorphism (SNP) analysis has drawn much attention 
with the hope of identifying genetic markers for common diseases. However, current 
genotyping technologies are only able to determine each SNP separately. In other words, 
there is a lack of information on how several SNPs are associated with each other 
physically on a chromosome. A DNA haplotype, the phase determined association of 
several DNA markers (e.g. SNPs) is a much more powerful method statistically for 
disease association studies. Yet unfortunately, it is also much harder to determine a 
haplotype. Current experimental approaches include a physical separation of homologous 
chromosomes via means of mouse cell line hybrid, cloning into a plasmid and allele 
specific PCR. Neither of them is simple enough a method for routine high-throughput 
analysis. There are also ways to computationally determine haplotypes, but the accuracy 
of such computational analysis is uncertain. 

[002] Haplotypes, combinations of several phase-determined polymorphic markers 
in a chromosome, are extremely valuable for studies like disease association 1,2 and 
chromosome evolution. Direct molecular haplotyping has relied heavily on family data, 
but is limited to short genomic regions (a few kilobases). Statistical estimation of 
haplotype frequencies can be inconclusive and inaccurate 3 . It would be desirable to have 
a method that enables direct molecular haplotyping over longer genomic regions. 
[003] With the rapid discovery and validation of several million single nucleotide 
polymorphisms (SNP), it is now increasingly practical to use genome-wide scanning to 
find genes associated with common diseases ,l2 . However, individual SNPs have limited 
statistical power for locating disease susceptibility genes. Haplotypes, combinations of 
several phase-determined polymorphic markers, can provide additional statistical power 
in the mapping of disease genes Arl . 

[004] Haplotype determination of several markers for a diploid cell is complicated 
since conventional genotyping techniques cannot determine the phases of several 
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different markers. For example, a genomic region with three heterozygous markers can 
yield 8 possible haplotypes. This ambiguity can, in some cases, be solved if pedigree 
genotypes are available. However, even for a haplotype of only 3 markers, genotypes of 
father-mother-offspring trios can fail to yield offspring haplotypes up to 24% of the time. 
Computational algorithms such as expectation-maximization (EM); subtraction and 
PHASE are used for statistical estimation of haplotypes 4 ***. However, these 
computational methods have serious limitations in accuracy, number of markers and 
genomic DNA length. For example, for a haplotype of only 3 markers from doubly 
heterozygous individuals, the error rates of the EM and PHASE methods for haplotype 
reconstruction can be as high as 27% and 19%, respectively 3 . Alternatively, direct 
molecular haplotyping can be used based on the physical separation of two homologous 
genomic DNAs prior to genotyping. DNA cloning, somatic cell hybrid construction, 
allele specific PCR and single molecule PCR l0 * 12 have been used, and these approaches 
are largely independent of pedigree, information. These methods are limited to short 
genomic regions (allele-specific PCR and single molecule PCR) and are prone to errors. 

SUMMARY OF THE INVENTION 
[005] The present invention provides an efficient way for high throughput haplotype 
analysis. Several SNP markers can be simultaneously determined through multiplex PCR 
of single DNA molecules. These SNPs can be of any distance to each other on the 
chromosome, something almost impossible to achieve for other methods. In addition, our 
overlapping SNP approach can be used to link smaller haplotypes into larger haplotypes. 
The invention provides a powerful new tool for diagnostic haplotyping. 
[006] The method of the present invention enables direct molecular haplotyping of 
several polymorphic markers separated by as far as about 25 kilobases (kb). In short, 
genomic DNA is first diluted to approximately single copy. The haplotype is 
subsequently determined by genotyping several polymorphic markers in the same 
reaction with multiplexed PCR and base extension reactions. This approach does not rely 
on pedigree data and does not require previous amplification of the genomic region 
containing the selected markers. 

{007] In one embodiment, the invention provides a method of haplotyping 
comprising the steps of identifying at least two SNPs within a genomic region; obtaining 
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a DNA sample; amplifying the DNA sample comprising more than one copy of the 
genomic DNA with at least two primer pairs each capable of amplifying a different 
region flanking each of the SNPs in separate reactions and individually genotyping the 
amplified DNA; diluting the DNA sample so that substantially only one copy of a 
genomic DNA is present in the dilution; amplifying the diluted DNA sample using the 
same primer pairs and performing a multiplex genotyping of the amplified DNA; 
determining the haplotype of the one copy of the genomic DNA genotyping the amplified 
sample from the single genomic DNA copy dilution. In a preferred method the 
genotyping is performed using primer expension, terminator nucleotides and MALDI- 
TOF analysis. 

[0081 111 another embodiment, the invention provides a method of haplotyping 
comprising obtaining a genomic DNA sample; diluting the DNA sample so that 
substantially only one copy of a genomic DNA is present in the dilution; identifying at 
least two SNPs within a genomic region; amplifying the diluted DNA sample using 
primer pairs designed to amplify a genomic region flanking the SNPs; and determining 
the haplotype of the copy of the genomic DNA by multiplex genotyping. 
[0091 In a preferred embodiment, the methods of the present invention use mass 
spectrometry, for example MassArray system, to genotype the samples. 
[010J Preferably at least two, more preferably at least three, four or five primer pairs 
are used simultaneously to amplify the polymorphic genomic regions. In one 
embodiment, at least ten primer pairs are used in the same reaction. In one embodiment, 
the haplotype is determined using at least two different combinations of primer pairs in at 
least two parallel reactions. 

BRIEF DESCRIPTION OF FIGURES 
[Oil] Figures 1 A-1B show a flow chart of multiplex genotyping of single DNA 
molecules for haplotype analysis. Traditional genotyping methods using a few nano- 
grams (ng) genomic DNA (about 1600 copies of genomic templates) yield only the 
genotypes of each individual SNP marker, but the phases of these SNPs are not 
determined (shown in top right in the mass spectra in Fig. 1 A). Simultaneous genotyping 
of several markers using multiplex assays with single DNA molecules (Fig. IB) allows 
haplotyping analysis since the two alleles can be physically separated with very dilute 
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DNA concentrations (shown in bottom right in the mass spectra in Fig. IB). In contrast 
to other molecular haplotyping methods, the entire haplotype block is not amplified in 
this approach. Instead, only about 100 bp around each individual SNP is amplified for 
genotyping, resulting in very high efficiency of PGR amplification from single DNA 
molecules. The SNP markers can be as far apart as desired, as long as there is no 
significant break between them. 

[012] Figure 2 shows effects of genomic DNA concentration on haplotyping 
efficiency. About 3 pg, 5 pg and 9 pg (or 1, 1 .6 and 3 copies of genomic templates) were 
used for haplotyping of 3 SNP markers in the CETP region. The DNA copy number in a 
specific reaction is estimated by the Poisson distribution. The haplotyping result can 
either be a failed assay, successful haplotyping, both alleles present (no phase 
determination for the markers), or an incomplete multiplex. Except for incomplete 
multiplexes, values are percentages from 54 to 144 individual multiplex assays (see 
methods section for details on the calculation), followed by predicted values using the 
Poisson distribution. 

[013] Figure 3 shows overlapping multiplex genotyping assays with single DNA 
molecules. Seven SNP markers (A: rs289744, B: rs2228667, C: rs5882, D: rs5880, E: 
rs5881, F: rs291044, G: 2033254) from an 8kb genomic region of the CETP locus were 
chosen (details of these SNPs, their chromosome position and oligonucleotides used for 
genotyping are provided in Table 2). Two 5-plex genotyping assays were designed for 
these 7 markers and the overlapping heterozygous SNPs were used to obtain the entire 
haplotype of 7 SNP markers. Assays on individual 6 were used to demonstrate how this 
is carried out. Multiplex assay 1 determined the haplotype of 5 SNPs as AGAGT and 
CGGGC. Multiplex assay 2 determined the other haplotype of 5 SNPs as GGGCT and 
AGGTT. Then, the genotypes of the overlapping SNPs (SNP C, E, F) were used to 
combine the two 5-SNP haplotypes into a haplotype of 7 SNPs covering the entire region 
under investigation. 

DETAILED DESCRIPTION OF THE INVENTION 
[014] The present invention provides a direct molecule haplotyping approach which 
is based upon a surprising discovery that a single molecule dilution of genomic DNAcan 
be used for separation of two homologous genomic DNAs. The diluted sample is used in 
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direct multiplex genotyping of several markers with, for example, the MassArray™ 
system (Fig. 1). The approach of the present invention differs significantly from previous 
single molecule PCR method in that the method of the present invention does not require 
the amplification of the complete genomic region containing the markers of interest; thus 
it is not limited to only a few kb DNA. Close to 100% genotype success rates for single 
DNA molecules has been achieved. Additionally, the multiplex genotyping assay 
approach enables direct haplotype determination without pedigree genotype information. 
High throughput haplotyping can easily be achieved by incorporating the method of the 
present invention with any commercially available genotyping systems, such as the 
MassArray™ system. 

[015] The invention provides a method of creating haplotypes comprising the steps 
of obtaining a DNA sample; amplifying the DNA sample comprising more than one copy 
of the genomic DNA with at least two primer pairs designed to amplify a genomic region 
containing a nucleic acid polymorphism and genotyping the amplified DNA; diluting the 
DNA sample so that substantially only one copy of a genomic DNA is present in the 
dilution; amplifying the diluted DNA sample using at least two primer pairs designed to 
amplify a genomic region containing a nucleic acid polymorphism and genotyping the 
amplified DNA; determining the haplotype of the one copy of the genomic DNA. 
[016] Genomic DNA. Genomic DNA can be isolated from a subject using any 
method of DNA isolation known to one skilled in the art Examples of DNA isolation 
methods can be found in general laboratory manuals, such as Sambrook and Russel, 
MOLECULAR CLONING: A LABORATORY MANUAL, 3rd Ed, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (2001), the entirety of which is herein 
incorporated by reference 

[017] Polymorphic Markers and Oligonucleotides. The number of polymorphic 
nucleic acid useful according to the present invention is ever increasing. Currently, such 
markers are readily available from a variety of publicly accessible databases and new 
ones are constantly being added to the pool of available markers. Particularly usefiil 
markers according to the present invention are single nucleotide polymorphisms (SNP). 
Examples of SNP databases include, but are not limited to Human SNP Database at 
http://www-genome.wi.mit.edu/snp/human, NCBI dbSNP Home Page at 
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http://www.ncbi.nlm.nih. gov/SNP. 

http://lifesciences.perkinelmerxom/SNPDatabase/welcome.asp, Celera Human SNP 
database at 

http://vmwxeIera.com/genomics/academic/homexfi3i7ppag the SNP 

Database of the Genome Analysis Group (GAN) at http://www- 
gan.iarc.fr/SNPdatobase.htmh 

[018] A number of nucleic acid primers are already available to amplify DNA 
fragments containing the polymorphisms and their sequences can be obtained, for 
example, from the above-identified databases. Additional primers can also be designed, 
for example, using a method similar to that published by Vieux, E.F., Kwok, P-Y and 
Miller, R. D. in BioTechniques (June 2002) Vol. 32. Supplement: "SNPs: Discovery of 
Marker Disease, pp. 28-32. 

[019] Genotyping. A number of different single nucleotide polymorphism (SNP) 
genotyping methods useful according to the present invention are known to one skilled in 
the art. Generally, SNP genotyping is performed after amplification of the genomic DNA 
samples using, for example polymerase chain reaction (PCR). For example, SNP 
genotyping by MALDI-TOF mass spectrometry can performed using, for example, the 
Sequenom mass spectrometry system. In this method, multiplexed PCR is performed 
using more than one primer pairs each flanking different SNPs and then a 
minisequencing primer extension reaction performed in a single well using chain 
terminator nucleotides. The size of reaction products is determined directly by MALDI- 
TOF mass spectrometry, yielding the genotype information. Multiplexing permits 
determination of, for example, up to about 5 SNPs in a single well of a 384 well plate. 

[0201 Alternatively, fragment analysis for SNP detection can be performed on 
batches of several samples on a capillary electrophoresis system, for example an ABI 
PRISM® 3100 GENETIC ANALYZER (Applied Biosystems, Foster City, CA). For 
capillary electrophoretic analysis, the primers can be labeled using dyes, including, but 
not limited to FAM, HEX, NED, LIZ, ROX, TAMRA, PET and VIC. 

[0211 Single SNP allelic discrimination can further be carried out using the ABI 
PRISM® 7900HT Sequence Deatection System (Applied Biosystems, Foster City, CA), 
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which allows analysis of single nucleotide polymorphisms (SNPs) using the fluorogenic 
5* nuclease assay. 

[022] Yet another available method useful according to the present invention is an 
Arrayed Primer Extension (APEX) which is a resequencing method for rapid 
identification of polymorphisms that combines the efficiency of an microarray-based 
assay (alternative to gel-based methods, see, e.g. U.S. Patent No. 6,153,379 and 
Shumaker et al. Hum. Mutat. 7(4):346-354, 1996) with the Sanger nucleic acid 
sequencing method (Sanger et al., Proc. Natl. Acad. Sci. 74:5463-5467 (1977)). 
Generally, microarrays are microchips, for example glass slides, containing thousands of 
DNA segments in an ordered array, witch allows the simultaneous analysis of thousands 
of genetic markers. 

[023] A yet another genotyping method useful according to the present invention is 
a solid-phase mini-sequencing technique, which is also based upon a primer extension 
reaction and can be used for genotyping of SNPs and can also be easily automated (U.S. 
Patent No. 6,013,431, Suomalainen et al. Mol. BiotechnoL Jun; 15(2): 123-31, 2000). 
[024] In general, a single-base extension reaction is a modified cycle sequencing 
reaction in which only dideoxynucleotides (terminators) are present. When a terminator is 
incorporated onto a DNA strand, no further extension can occur on that strand. In a 
standard cycle sequencing reaction, terminators are present only in small concentrations 
along with high concentrations of typical nucleotides. In the single base extension 
reactions for SNP assays, two or more fluorescently labeled terminator nucleotides 
(corresponding to the two or more alleles present at the SNP to be typed) are used. 
[025] Haplotype analysis. Haplotype is defined as a combination of alleles or 
single nucleotide polymorphisms (SNPs) of closely linked loci that are found in a single 
chromosome which tend to be inherited together. For a specific gene segment, there are 
often many theoretically possible combinations of SNPs, and therefore there are many 
theoretically possible haplotypes. 

[026] Information about gene flow in a pedigree can be used to reconstruct likely 
haplotypes for families and individuals. In one embodiment, the present invention 
provides a method wherein a set of SNPs are first identified, each polymorphic SNP 
marker from a subject is then genotyped individually, for example using about 1-10 ng, 
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preferably about 5 ng genomic DNA. The genomic DNA sample is then diluted into 
about 1 copy of genomic template per dilution. The haplotype is determined by 
determining the SNP's in the diluted sample. For haplotype analysis, multiplex 
genotyping assays using, for example, primers which anneal directly before the SNP and 
can be extended using an extension primer and chain terminating nucleotides, are 
consequently carried out using about 3 pg or approximately 1 copy of genomic template, 
unless otherwise specified genomic DNA. 

[027] Alternatively, the dilution of single copy of genomic template is performed 
directly and the SNPs are haplotyped without prior genotyping of the samples. In one 
embodiment, the genotyping is used as a quality control for the amplification reaction. 

[028] The genomic region to be haplotyped using the method of the present 
invention is preferably at least about 8 kb, at least about 10 kb or more, at least about 1 5 
kb or more, at least about 20 kb or more, at least about 25 kb or more, at least about 35 kb 
or more and at least about 40 kb or more. Most preferably the genomic region is about 
25 kb ore more. 

[029] In determining the haplotypes, the genotyping reactions are multiplexed which 
term is meant to include combining at least two, preferably more than three, four five or 
up to about 1 0 extension primers in the same reaction to identify, preferably more than 3, 
4, 5, or up to at least 10 SNPs in the same genotyping reaction. 
EXAMPLE 

[030] We first investigated the effects of genomic DNA concentration on 
haplotyping efficiency. We used 3 picograms (pg), 5 pg and 9 pg (equivalent of 1, 1.6 
and 3 genomic template copies) of genomic DNA for PCR amplification and genotyping 
of 3 SNPs in the CETP region from 12 individuals. Each 3-plex assay was repeated 12- 
1 8 times to evaluate the PCR and haplotyping efficiency. A typical assay result is 
summarized in Table 1. The copy number of the genomic DNA region of interest for 
very dilute DNA solutions is estimated by the Poisson distribution 13 . Haplotyping 
results are categorized into 4 groups (Table 1). Failed assays can result from either failed 
PCR amplification from single copy DNAs or simply no template present due to 
stochastic fluctuation of very dilute DNA solutions. Partially failed genotyping calls (or 
incomplete multiplexes) are those that have only 1 or 2 SNPs successfully genotyped. 
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This is most likely due to unsuccessful PCR for 1 or 2 of the SNP DNA regions, since in 
most cases the 3 SNP markers are present or absent at the same time due to the close 
proximity of the SNP markers (< 628 bp). Poisson distribution may also result in the 
presence both alleles in the solution and hence the inability to resolve the phase of the 
SNPs. Successful haplotyping analysis can be achieved when a single copy of the allele 
or multiple copies of the same allele are present and the genotyping is successful. 
[031] Incomplete multiplex genotyping can be used to estimate the efficiency of 
genotyping from single copy DNA molecules. A partial genotyping call suggests the 
presence of the SNP DNA but a failure to genotype some of the SNPs. We typically 
observe 5-10% incomplete multiplex genotyping calls (Fig. 2), suggesting a PCR 
efficiency of 90-95% with single DNA molecules. This approach may overestimate the 
PCR efficiency, since we did not take the completely failed assays into account. We also 
carried out detailed comparison between observed and theoretical values of failed assays, 
successful haplotyping and the presence of both alleles (Fig. 2 and see methods section 
for details of calculation). Theoretical values are based on the Poisson distribution of 
very dilute DNA solutions and the assumption of 100% PCR amplification efficiency. 
The close agreement between theoretical estimate and experimental observation 
substantiates the earlier estimate of extremely high PCR efficiency with single DNA 
molecules. This high PCR efficiency is mainly due to the high efficiency of 
amplification of very short amplicons (typically 100 bp) and the high sensitivity of 
MALDI-TOF mass spectrometric detection of DNA oligonucleotides. This high PCR 
efficiency is absolutely crucial for high-throughput haplotyping analysis. With our 
current PCR efficiency, we can achieve 40-45% haplotyping efficiency with one single 
reaction using 3-4.5 pg genomic DNA. A replicate of 4 independent multiplex 
genotyping assays will enable about 90% of direct haplotyping efficiency. 
[032] We next demonstrate an approach for determining haplotypes where there are 
too many markers to be determined in one multiplex genotyping assay. Overlapping 
informative SNPs can be used to combine haplotypes from several multiplex assays. Six 
SNP markers in an 8kb CETP genomic region were chosen, and 2 overlapping 4-plex 
genotyping assays were used for haplotyping analysis (Fig. 3). We were able to 
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determine the haplotypes of all 12 individuals for that genomic region, with absolutely no 
optimization of the assay system. 

[033] The approach presented here provides a powerful and unique technology 
platform for direct molecular haplotyping analysis of long-range genomic regions. This 
approach is completely independent of pedigree genotype information. We have also 
incorporated this technique with the commercially available MassArray™ system for 
high-throughput applications. This technology can be extremely useful in large-scale 
haplotyping and haplotype-based diagnostics. 

[034] Genomic DNAs and oligo nucleotides. Human genomic DNA samples used 
for haplotyping of the CETP locus were provided by SEQUENOM Inc. (San Diego, CA). 
These DNAs were isolated using the Puregene DNA isolation kit (Gentra Systems) from 
blood samples purchased from the Blood Bank (San Bernadino County, CA). The 
personal background of the blood donors is not accessible for these samples. Human 
genomic DNAs samples for haplotyping of a 25kb segment on chromosome 5q31 were 
CETP family DNAs purchased from Coriell Cell Repositories (see Table 3). Information 
on SNPs and oligonucleotides for genotyping is provided in Table 2. 
[035] Genotyping and haplotyping analysis. Genotyping analyses were carried 
out using the MassArray™ system (SEQUENOM Inc.). Each SNP from every individual 
was first genotyped individually using 5 ng genomic DNA. For haplotyping analysis, 
multiplex genotyping assays were carried out using 3 pg (or approximately 1 copy of 
genomic template, unless otherwise specified) genomic DNA. 
[036] Analysis of effects of genomic DNA concentration on haplotyping. To 
calculate the percentage of failed assays, we simply counted all failed assays (no calls for 
either SNP), divided by the total number of assays. We typically do 12 to 1 8 replicates 
for each 6 or 12 individuals. The percentage of incomplete assays is calculated in the 
same way. To calculate percentage of successful haplotyping and both alleles, we 
excluded the data from those individuals with homozygous haplotypes. Theoretical 
predictions are based on the Poisson distribution of very diluted DNA solutions, 
according to a published method 13 . 
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Genotvoe Calls 


1 


GGC b 


2 


GGC 






» 


-GC d 






A 
U 


GGC 


7 




S 


ACA 


9 


-GC ' 


10 


A/GC/GA/C C 


11 


ACA 


12 


ACA 



a Genotypes of 3 SNP markers were determined with triplex assays from 
3 pg genomic DNA. 

b The 3 SNPs are G, G, C genotype respectively. 
c Failed to genotype any of the 3 SNPs. 

d Failed to genotype the first SNP, the rest two SNPs are G and C 
respectively. 

bailed to separate the two alleles, thus the genotypes are A/G, C/A and 
A/C for the 3 SNPs. 



Table 2. Single nucleotide polymorphism (SNP) markers; their chromosomal locations, 
primer pairs to amplify the markers and terminator mixes used in the reaction. 

Ter 

Chrom. "J. 
SNP ID Position PCR primer 1 PCR primer 2 Extension Primer Mix 

472826 

rs289741 25 TCTACCAGCTTGGCTCCCTC AAGTCCATCAGCAGCAGCAG GGGAGTCAGCCCAGCTC AC" 

472823 

rs289742 37 ACTGGTGAGACAATCCCTTC CCACTGGCATTAAAGTGCTG AGCCACAGAAGAAGGACTCC AC" 

472819 

rs289744 97 TACCAGAAACCAGACCTCTG AGTGCTGGACAGAAAGTGAG TGAGGATGGTGGGAGGG AC 

472819 

rs289744° 97 TCTACCAGAAACCAGACCTC AGTGCTGGACAGAAAGTGAG ACCTCTGAGGGCCCCTTAC CG 

472828 

rs2228667 a 20 CTCGAGTGATAATCTCAGGG AGGTAGTGTTTACAGCCCTC TGATGATGTCGAAGAGGCTCATG CO 

472840 

rs5882 a 07 TTACGAGACATGACCTCAGG GCATTTGATTGGCAGAGCAG CTGCAGGAAGCTCTGGATG CC 

472840 

rs5882 b 07 GCATTTGATTGGCAGAGCAG TTACGAGACATGACCTCAGG AGAGCAGCTCCGAGTCC AC 
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472850 






08 


GCAGCACATACTGGAAATCC 




472880 




rs5881 a 


87 


AGATCTTGGGCATCTTGAGG 


472880 




re5ftfl1 b 
1 MOO 1 


87 


ACCCCTGTCTTCCACAGGTT 




472886 




rs291044 a 


47 


GTAAAACTGCAGCTGAGGAG 


472886 




rs291044 b 


47 


TGACTAGGTCAGGTCCCCTC 




472901 




fS2033254 b 


14 


GGACATCAAAGGAACAGGAC 


1GR2198A_ 


506266 




1 c 


d 


GGGTTGCATGAGCATTAAGT 


IGR2175A. 






2 


495082 


TGGCCTTGATTCAAACCCTG 


IGR2150A_ 






1 


482171 


CCTTGGCTTGATAGTCAAAC 



TTTCTCTCCCCAGGATATCG 
ACCCCTGTCTTCCACAGGTT 
AGATCTTGGGCATCTTGAGG 
TGACTAGGTCAGGTCCCCTC 
GTAAAACTGCAGCTGAGGAG 
ACTCACAATATTGGGCAGGC 

CACATCAAGGATAAGACTGC 
AGATGAAGGAAATCCCAAGG 
ATTTGGAGGAGTGCAGAGAG 



GCTTTTTCTTAGAATAGGAGG ACT 

TGGGCCTGGCTGGGGAAGC CG 

TGTCTTCCACAGGTTGTCGGC ACT 

GGAGTATTTAAAGGAGAGACACACTAG CG 

CCCTCGTGCCACAGCCT ACT 

CAAGGGGCTAAGGGAGAAG ACT 

ATCTCTTCAGTAGACGAAC AC 

TGCCACTAACATACATAGTAAC AC 

AGTCAAACTCTCACCAC AC 



°Mufflp!exGroupa 
Multiplex Group b 
'SNP ID from ref 
''Position of SNP from ref 



Tenn. Mix = terminator nucleotide mix. Chrom. Position = chromosomal position 



Table 3. DN A samples used in the Example. 



Repository 
Number 



Sample Type Sample Description 



Relation 



GM12547 Lymphoblast CEPH/FRENCH PEDIGREE 66 father 

GM12548 Lymphoblast CEPH/FRENCH PEDIGREE 66 mother 

GM12549 Lymphoblast CEPH/FRENCH PEDIGREE 66 son 

GM12550; Lymphoblast CEPH/FRENCH PEDIGREE 66 daughter 

GM12551 Lymphoblast CEPH/FRENCH PEDIGREE 66 daughter 

GM12552 Lymphoblast CEPH/FRENCH PEDIGREE 66 son 

GM12553 Lymphoblast CEPH/FRENCH PEDIGREE 66. daughter 

GM12554 Lymphoblast CEPH/FRENCH PEDIGREE 66 daughter 

GM125S5 Lymphoblast CEPH/FRENCH PEDIGREE 66 son 

GM12556 Lymphoblast CEPH/FRENCH PEDIGREE 66 paternal grandfather 

GM12557 Lymphoblast CEPH/FRENCH PEDIGREE 66 paternal grandmother 

GM125S8 Lymphoblast CEPH/FRENCH PEDIGREE 66 maternal grandfather 

GM12559 Lymphoblast CEPH/FRENCH PEDIGREE 66 maternal grandmother 

GM07038 Lymphoblast CEPH/UTAH PEDIGREE 1333 father 

GM06987 Lymphoblast CEPH/UTAH PEDIGREE 1333 mother 

GM07004 Lymphoblast CEPH/UTAH PEDIGREE 1333 son 

GM07052 Lymphoblast CEPH/UTAH PEDIGREE 1333 son 

GM06982 Lymphoblast CEPH/UTAH PEDIGREE 1333 son 

GM07011 Lymphoblast CEPH/UTAH PEDIGREE 1333 daughter 

GM07009 Lymphoblast CEPH/UTAH PEDIGREE 1333 son 
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GM07678 
GM07026 
GM07679 
GM07049 
GM070Q2 
GM07017 
GM07341 
GM11820 

GM07029 

GM07019 

GM07062 

GM07053 

GM07008 

GM07040 

GM07342 

GM07027 

GM06994 

GM07000 

GM07022 . 

GM07056 

GM11821 

GM07349 

GM07348 

GM07350 

GM07351 

GM07352 

GM07353- 

GM07354 

GM07355 

GM07356 

GM07347 

GM07346 

GM07357 

GM07345 



Lymphoblast 
Lymphoblast 
Lymphoblast 
Lymphoblast 
Lymphoblast 
Lymphoblast 
Lymphoblast 
Lymphoblast 



CEPH/UTAH PEDIGREE 1333 son 
CEPH/UTAH PEDIGREE 1333 son 
CEPH/UTAH PEDIGREE 1333 son 
CEPH/UTAH PEDIGREE 1333 paternal grandfather 
CEPH/UTAH PEDIGREE 1 333 paternal grandmother 
CEPH/UTAH PEDIGREE 1333 maternal grandfather 
CEPH/UTAH PEDIGREE 1333 maternal grandmother 



CEPH/UTAH PEDIGREE 1333 



Lymphoblast CEPH/UTAH PEDIGREE 1 340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340- 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 
Lymphoblast CEPH/UTAH PEDIGREE 1340 



daughter 

father 
mother 
daughter 
daughter 

son 

son 

son 

son 

paternal grandfather 
paternal grandmother 

maternal grandfather 
maternal grandmother 



Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345: 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE- 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 
Lymphoblast CEPH/UTAH PEDIGREE 1345 



son 

father 
mother 
daughter 

son 

son 

son 
daughter 

son 

son 



paternal grandfather 
paternal grandmother 

maternal grandfather 
maternal grandmother 
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CLAIMS 

We claim: 

1 . A method of haplotyping comprising the steps of identifying at least two 
SNPs within a genomic region; obtaining a DNA sample; amplifying the 
DNA sample comprising more than one copy of the genomic DNA with at 
least two primer pairs each capable of amplifying a different region flanking 
the at least two SNPs in separate reactions and individually genotyping the 
amplified DNA; diluting the DNA sample so that substantially only one copy 
of a genomic DNA is present in the dilution; amplifying the diluted DNA 
sample using the same primer pairs and performing a multiplex genotyping of 
the amplified DNA; after genotyping, determining the haplotype of the one 
copy of the genomic DNA from the single genomic DNA copy dilution. 

2. A method of haplotyping comprising obtaining a genomic DNA sample; 
diluting the DNA sample so that substantially only one copy of a genomic 
DNA is present in the dilution; identifying at least two SNPs within a genomic 
region; amplifying the diluted DNA sample in a multiplex reaction using 
primer pairs designed to amplify the genomic regions flanking the SNPs; and 
determining the haplotype of the copy of the genomic DNA by multiplex 
genotyping. 

3- The method of claim 2, wherein the genomic DNA sample is genotyped, in a 
separate reaction using more than one copy of the genomic DNA. 

4. The method of any of the preceding claims, wherein the genotyping is 
performed using primer extension, terminator nucleotides and mass 
spectrometry. 
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Dilution 
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Dilution into single molecule physically 
separates two haplotypes and simultaneous 
genotyping of all three SNPs with multiplex 

assay directly determines the haplotype 
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